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times and a combined forward-reverse search of 
mass spectral reference spectra. The primary ob- 
jective of this study is to develop computational 
pattern recognition procedures for the identifica- 
tion of chemical classes and, if possible, individual 
compounds from routine GC-MS data files. The 
additional information on nontarget compounds 
would supplement compound identifications ob- 
tained from the target list. The target set of com- 
pounds investigated consisted of 78 substituted 
benzenes, haloalkanes and haloalkenes. 



Methods 

SIMCA Pattern Recognition 

SIMCA pattern recognition was developed by 
Wold and coworkers [1] for application to classifi- 
cation problems in chemistry. The technique is 
based on the modeling of chemical classes by dis- 
joint principal component models. Once the class 
models have been determined, objects are classified 
by fitting their data to the class models. A standard 
deviation for each model is calculated from the 
residuals which represents a class tolerance level 
around the model in measurement space. 
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Introduction 

The most frequently used method of analysis of 
trace organic pollutants in ambient air is gas chro- 
matography-mass spectrometry (GC-MS). Present- 
ly, target compounds in air monitoring samples are 
identified from gas chromatographic retention 



Data Analysis 

In this study an IBM PC-XT microcomputer 
with 640K memory was used. A modified version 
of the SIMCA program was used for the majority 
of the pattern recognition calculations. The low 
resolution mass spectra of 78 compounds were ob- 
tained from the EPA-NIH Mass Spectral Library 
on an INCOS data system (see table 1). The GC- 
MS data files were from routine calibration and 
analysis performed by the Environmental Monitor- 
ing Systems Laboratory, U.S. Environmental Pro- 
tection Agency, Research Triangle Park, NC. 

The data were preprocessed by taking the auto- 
correlation transform of the mass spectra for mass 
lags less than 1(K). In the initial stages of class mod- 
eling, the autocorrelation transformed spectra for 
the 78 training compounds were examined for class 
separation with two and three dimensional princi- 
pal components projections of the training data, 
Three different groups were found: nonhalo- 
genated benzenes; chloroaromatics, chloroalkanes, 
chloroalkenes; and bromocarbons. 

Principal components models were then derived 
for each class [2]. None of the class models 
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Table 1. Compounds included in this pattern recognition study 



Compound 



Class' 



p -Xylene 

1,3, 5-Trimethylbenzene 
Isopropylbenzene 
n-butylbenzene 
l-Methyl-4-isopropylbenzene 
o-Dichlorobenzene 2 

/!-Dichlorobenzene 2 

l-Chloro-2-methylbenzene 2 

l-Chloro-4-methylbenzene 2 

p-Chlorostyrene 2 

1,1-Dichloroethane 2 

1,1,1,2-Tetrachloroethane 2 

1,2,3-Trichloropropane 2 

3-Chloro-l-propene 2 

2-Chlorobutane 2 

1,3-Dichlorobutane 2 

1,4-Dichlorobutane 2 

l,4-Dichloro-2-butene {cis) 2 

3,4-Dichlorobut-l-ene 2 

1,4-Dioxane 

l-Chloro-2,3-epoxypropane 2 

2-Chloroethoxyethene 2 

Acetophenone 
Benzonitrile 
Benzene 
Toluene 
o-Xylene 
m -Xylene 
Ethylbenzene 
Styrene 

Chlorobenzene 2 

Bromobenzene 3 

m-Dichlorobenzene 2 

l-Chloro-3-methylbenzene 2 

Chloroform 2 

Carbon tetrachloride 2 

Bromochloromethane 3 

Bromotrichloromethane 3 



Compound 



Class^ 



Dibromomethane 


3 


Bromoform 


3 


1 ,2-DichIoroe thane 


2 


1, 1,1-Trichloroethane 


2 


1 , 1 ,2-Trichloroethane 


2 


1 , 1 ,2,2-Tetrachloroe thane 


2 


Pentachloroethane 


2 


1,1-Dichloroethane 


2 


Trichloroethene 


2 


Tetrachloroethene 


2 


Bromoethane 


3 


1 ,2-Dibromoethane 


3 


l-Chloropropane 


2 


2-Chloropropane 


2 


1,2-Dichloropropane 


2 


1 ,3-Dichloropropane 


2 


l-Bromo-3-chloropropane 


3 


1 ,2-Dibromopropane 


3 


2,3-Dichlorobutane 


2 


Tetrahydrofuran 





Benzaldehyde 


1 


1 -Bromo-1 -chloroethane 


3 


2,2-Dibroraopropane 


3 


2-Bromopropene 


3 


2-Bromopropane 


3 


3-Bromopropene 


3 


l-Bromopropane 


3 


1-Chlorobutane 


2 


l-Bromo-2-chloroe thane 


3 


Bromodichloromethane 


3 


1-Bromobutane 


3 


2,2-Dichlorobutane 


2 


Dibromochloromethane 


3 


1 , 1 ,2-Trichloropropane 


2 


1 , 3-Dibromopropane 


3 


1, 1,1,2-Tetrachloropropane 


2 


1 ,2,2,3-Tetrachloropropane 


2 


1 ,3-Dibromobutane 


3 


1,1,2, 3-Tetrachloropropane 


2 


1 ,4-Dibromobutane 


3 



'Class l=nonhaloaromatics; class 2 = chlorocarbons including aromatics and aliphatics; class 
3 = bromo- and bromochlorocarbons; class 0=internal standards and dioxane and tetrahydrofuran. 



required more than three principal components. 
The largest number of autocorrelation coefficients 
relevant to a specific class was 40 in class 3. 
Eighty-six percent of the training set compounds 
were assigned to their correct class as a first choice 
and were within two standard deviations of the 
models. 

Hierarchical Qassification Scheme [2] 

Once a class assignment is made, further identifi- 
cation of each spectrum can be made. This is done 



by calculating the distance, using the entire spec- 
trum in autocorrelation space, of each compound 
to its three nearest neighbors in its assigned class. 
To obtain a specific structural assignment, the nor- 
malized correlation coefficient of the unknown 
mass spectrum with that of each of its three nearest 
neighbors is obtained. If the unknown is not identi- 
fied as one of the three nearest neighbors in the 
first class assignment, the correlation coefficients of 
the three nearest neighbors in the second SIMCA 
class assignment are compared. 
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Results 

GC-MS Calibration Samples 

Three direct injection calibration samples, which 
contained all but one of the training set com- 
pounds, were analyzed [2], Of the 77 compounds 
observed in the calibration runs, 84% were correct- 
ly identified. Of the 13 compounds not identified, 
three were correctly assigned by chemical class. 

GC-MS Field Samples 

The GC-MS data files for three field samples 
were also analyzed [3]. The identities of the target 
compounds were determined by using both GC re- 
tention times and a combination of forward and re- 
verse spectral matching techniques with stringent 
matching parameters. The identification of other 
compounds not on the target list was based on a 
Finnigan search technique. The application of the 
pattern recognition scheme to the transformed data 
for the target compounds resulted in 88% correct 
classification. The compound identification results 
were 85% accurate. 

There were 75 different nontarget compounds 
identified in 120 occurrences in the three samples. 
The classification results agreed very well for the 
two class 1 and class 2 spectra. However, a very 
large number of alkanes and allienes were incorrect- 
ly classified as chlorocompounds. Further details 
of this study are given in references [2], [3], and [4]. 

Although the research described in this article 
has been funded by the U.S. Environmental Pro- 
tection Agency under Cooperative Agreement 
CR-811617 with the University of Illinois at 
Chicago, it has not been subjected to Agency re- 
view. The mention of commercial products does not 
constitute endorsement or recommendation for use. 
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Annular Denuders and Filter Packs 
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Measurements of acidic (e.g., SO2, HNO3) and 
basic gases (e.g., NHj) that coexist with fine parti- 
cles (<2.5 Jim) are used in models to assist in deter- 
mining the origin and age of aerosols. Bias associ- 
ated with each measurement method used to obtain 
this air quality data can degrade the real correla- 
tion between species. In addition, the sensitivity of 
most instrumental methods to measure SO2, NH3, 
HNO3 or HNO2 is limited to measurements at con- 
centration levels above 2-5 ppb. Measurement of 
HNO3 has also proven to be more difficult due to 
losses of HNO3 in the sampling inlets used in some 
of the measurement procedures. Difficulties in dif- 
ferentiating atmospheric HNO3 from HNO3 pro- 
duced by the dissociation of NH4NO3 during sam- 
pling, further complicates monitoring procedures. 

Possanzini et al. [1] described the development 
of an annular denuder system which removes reac- 
tive gases (e.g., HNO3, SO2) from air samples an 
order of magnitude more efficiently per unit length 
and at lower Reynolds number than open tubular 
denuder designs. Recently we developed an im- 
proved version of the annular denuder that incor- 
porates several important features to minimize 
losses of key species during sampling and reduce 
the possibility of leaks in the components that join 
the various parts of the system. 

To demonstrate the applicability of this im- 
proved design of the annular denuder, a series of 
field studies were conducted in Research Triangle 
Park, NC during the fall of 1986 and winter of 
1987. An annular denuder system (fig. 1) is com- 
posed of four components: an inlet (to remove 
large particles), coated denuders (to collect the 
acidic and basic gases), filter pack (to collect fine 
particles and HNO3 that may evaporate from the 
filter), and a flow controller-pump assembly. In 
this study a new glass Teflon-coated impactor inlet 
was designed with a short tube extending below 
the impactor to prevent large particles and rain 
droplets from entering the impactor. With the ex- 
ception of the impactor surface, the entire inlet is 
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